
if 



1/ 



Page 1 of 7 



OIPE 



ENTERED 



RAW SEQUENCE LISTING DATE: 11/13/2001 

PATENT APPLICATION: US/09/841 , 963 TIME: 15:37:56 

Input Set : N:\Crf3\RULE60\09841963.txt 
Output Set: N:\CRF3\11132001\l841963.raw 

4 <110> APPLICANT: MUSC FOUNDATION FOR RESEARCH DEVELOPMENT 

6 <120> TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DIAGNOSIS AND TREATMENT 

7 OF CANCER BASED ON THE TRANSCRIPTION FACTOR ETS2 
9 <130> FILE REFERENCE: 9175-006-228 

11 <140> CURRENT APPLICATION NUMBER: 09/841,963 

12 <141> CURRENT FILING DATE: 2001-04-25 

14 <150> PRIOR APPLICATION NUMBER: 99/27805 

15 <151> PRIOR FILING DATE: 1999-11-23 
17 <160> NUMBER OF SEQ ID NOS : 34 

19 <170> SOFTWARE : FastSEQ for Windows Version 3.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 2269 

23 <212> TYPE: DNA 

24 <213> ORGANISM: Homo sapiens 

26 <220> FEATURE: 

27 <221> NAME/KEY: CDS 

28 <222> LOCATION: ( 292 )...( 1698 ) 

30 <400> SEQUENCE: 1 

31 ccgtttcctc ccctcccctc cactcggccg tccctccttc ctcctccctc ctccctcctc 60 

32 ctcccgctcc tgaagagcgc gccgcgtggg ggacggcccg gttacttcct ccagagactg 120 

33 acgagtgcgg tgtcgctcca gctcagagct cccggagccg cccggccagc gtccggcctc 180 

34 cctgatcgtc tctggccggc gccctcgccc tcgcccggcg cgcaccgagc agccgcgggc 240 

35 gccgagcagc caccgtcccg accaagcgcc ggccctgccc gcagcggcag g atg aat 297 

36 Met Asn 

37 1 
39 gat ttc gga ate aag aat atg gac cag gta gec cct gtg get aac agt 34 5 
4 0 Asp Phe Gly He Lys Asn Met Asp Gin Val Ala Pro Val Ala Asn Ser 
41 5 10 15 

43 tac aga ggg aca etc aag cgc cag cca gec ttt gac acc ttt gat ggg 393 

44 Tyr Arg Gly Thr Leu Lys Arg Gin Pro Ala Phe Asp Thr Phe Asp Gly 

45 20 25 30 

47 tec ctg ttt get gtt ttt cct tct eta aat gaa gag caa aca ctg caa 441 

48 Ser Leu Phe Ala Val Phe Pro. Ser Leu Asn Glu Glu Gin Thr Leu Gin 

49 35 40 45 50 

51 gaa gtg cca aca ggc ttg gat tec att tct cat gac tec gec aac tgt 489 

52 Glu Val Pro Thr Gly Leu Asp Ser He Ser His Asp Ser Ala Asn Cys 

53 55 60 65 

55 gaa 

56 Glu Leu Pro Leu Leu Thr Pro Cys Ser Lys 
57 

59 tta aaa get acc ttc agt ggc ttc aaa aag gaa 

60 Leu 
61 

63 att 

64 He 
65 

67 ctt 



ttg 


cct 


ttg 


tta 


acc 


ccg 


tgc 


age 


Leu 


Pro 


Leu 


Leu 


Thr 


Pro 


Cys 


Ser 






70 










75 


aaa 


get 


acc 


ttc 


agt 


ggc 


ttc 


aaa 


Lys 


Ala 


Thr 


Phe 


Ser 


Gly 


Phe 


Lys 




85 










90 




cca 


aag 


aac 


ccc 


tgg 


ctg 


tgg 


agt 


Pro 


Lys 


Asn 


Pro 


Trp 


Leu 


Trp 


Ser 


100 










105 






etc 


tgg 


gee 


acc 


aat 


gag 


ttc 


agt 



gtg 


atg 


agt 


caa 


gec 


537 


Val 


Met 


ser 
80 


Gin 


Ala 




cag 


egg 


cgc 


ctg 


ggc 


585 


Gin 


Arg 

95 


Arg 


Leu 


Gly 




cag 


gta 


tgc 


cag 


tgg 


633 


Gin 


Val 


Cys 


Gin 


Trp 




110 












aac 


gtg 


aat 


ctg 


cag 


681 
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68 


Leu 


Leu 


Trp Ala 


Thr 


Asn 1 


Glu 


Phe 


Ser 


Leu 


Val 


Asn 


Val . 


Asn 


Leu 


Gin 




69 


115 










120 










125 










130 




71 


agg 


ttc 


ggc 


atg 


aat 


ggc 


cag 


atg 


ctg 


tgt 


aac 


ctt 


ggc 


aag 


gaa 


cgc 


729 


72 


Arg 


Phe Gly Met 


Asn 


Gly ■ 


Gin : 


Met 


Leu 


Cys 


Asn 


Leu i 


Gly 


Lys 


Glu 


Arg 




73 










135 










140 










145 






75 


ttt 


ctg 


gag 


ctg 


gca 


cct 


gac 


ttt 


gtg 


ggt 


gac 


att 


etc 


tgg 


gaa 


cat 


777 


76 


Phe 


Leu i 


Glu 


Leu 


Ala 


Pro , 


Asp 


Phe 


Val 


Gly 


Asp 


He : 


Leu 


Trp 


Glu 


His 




77 








150 










155 










160 








79 


ctg 


gag 


caa 


atg 


ate 


aaa 


gaa 


aac 


caa 


gaa 


aag 


aca 


gaa 


gat 


caa 


tat 


825 


80 


Leu 


Glu i 


Gin ; 


Met 


lie 


Lys 1 


Glu . 


Asn 


Gin 


Glu 


Lys 


Thr ' 


Glu . 


Asp 


Gin 


Tyr 




81 






165 










170 










175 










83 


gaa 


gaa 


aat 


tea 


cac 


etc 


ace 


tec 


gtt 


cct 


cat 


tgg 


att 


aac 


age 


aat 


873 


84 


Glu 


Glu < 


Asn 


Ser 


His 


Leu 


Thr 


Ser 


Val 


Pro 


His 


Trp 


He , 


Asn 


Ser 


Asn 




85 




180 










185 










190 












87 


aca 


tta 


ggt 


ttt 


ggc 


aca 


gag 


cag 


gcg 


ccc 


tat 


gga 


atg 


cag 


aca 


cag 


921 


88 


Thr 


Leu Gly 


Phe 


Gly 


Thr 1 


Glu 1 


Gin 


Ala 


Pro 


Tyr 


Gly ; 


Met 


Gin 


Thr 


Gin 




89 


195 










200 










205 










210 




91 


aat 


tac 


ccc 


aaa 


ggc 


ggc 


etc 


ctg 


gac 


age 


atg 


tgt 


ccg 


gee 


tec 


aca 


969 


92 


Asn 


Tyr 


Pro 


Lys 


Gly 


Gly 


Leu 


Leu 


Asp 


Ser 


Met 


Cys 


Pro 


Ala 


Ser 


Thr 




93 










215 










220 










225 






95 


ccc 


age 


gta 


etc 


age 


tct 


gag 


cag 


gag 


ttt 


cag 


atg 


ttc 


ccc 


aag 


tct 


1017 


96 


Pro 


Ser 


Val 


Leu 


Ser 


Ser 


Glu 


Gin 


Glu 


Phe 


Gin 


Met 


Phe 


Pro 


Lys 


Ser 




97 








230 










235 










240 








99 


egg 


etc 


age 


tec 


gtc 


age 


gtc 


ace 


tac 


tgc 


tct 


gtc 


agt 


cag 


gac 


ttc 


1065 


100 


Arg 


Leu 


Ser 


Ser 


Val 


Ser 


Val 


Thr 


Tyr 


Cys 


Ser 


Val 


Ser 


Gin 


Asp 


Phe 




101 






245 










250 










255 










103 


cca 


ggc 


age 


aac 


ttg 


aat 


ttg 


etc 


acc 


aac 


aat 


tct 


ggg 


act 


ccc 


aaa 


1113 


104 


Pro 


Gly 


Ser 


Asn 


Leu 


Asn 


Leu 


Leu 


Thr 


Asn 


Asn 


Ser 


Gly 


Thr 


Pro 


L Y S 




105 




260 










265 










270 












107 


gac 


cac 


gac 


tec 


cct 


gag 


aac 


ggt 


gcg 


gac 


age 


ttc 


gag 


age 


tea 


gac 


1161 


108 


Asp 


His 


Asp 


Ser 


Pro 


Glu 


Asn 


Gly 


Ala 


Asp 


Ser 


Phe 


Glu 


Ser 


Ser 


Asp 




109 


275 










280 










285 










290 




111 


tec 


etc 


etc 


cag 


tec 


tgg 


aac 


age 


cag 


teg 


tec 


ttg 


ctg 


gat 


gtg 


caa 


1209 


112 


Ser 


Leu 


Leu 


Gin 


Ser 


Trp 


Asn 


Ser 


Gin 


Ser 


Ser 


Leu 


Leu 


Asp 


Val 


Gin 




113 










295 










300 










305 






115 


egg 


gtt 


cct 


tec 


ttc 


gag 


age 


ttc 


gaa 


gat 


gac 


tgc 


age 


cag 


tct 


etc 


1257 


116 


Arg 


Val 


Pro 


Ser 


Phe 


Glu 


Ser 


Phe 


Glu 


Asp 


Asp 


Cys 


Ser 


Gin 


Ser 


Leu 




117 








310 










315 










320 








119 


tgc 


etc 


aat 


aag 


cca 


ace 


atg 


tct 


ttc 


aag 


gat 


tac 


ate 


caa 


gag 


agg 


1305 


120 


Cys 


Leu 


Asn 


Lys 


Pro 


Thr 


Met 


Ser 


Phe 


Lys 


Asp 


Tyr 


He 


Gin 


Glu 


Arg 




121 






325 










330 










335 










123 


agt 


gac 


cca 


gtg 


gag 


caa 


ggc 


aaa 


cca 


gtt 


ata 


cct 


gca 


get 


gtg 


ctg 


1353 


124 


Ser 


Asp 


Pro 


Val 


Glu 


Gin 


Gly 


Lys 


Pro 


> Val 


He 


Pro 


Ala 


Ala 


Val 


Leu 




125 




340 










345 










350 












127 


gec 


ggc 


ttc 


aca 


gga 


agt 


gga 


cct 


att 


cag 


ctg 


tgg 


cag 


ttt 


etc 


ctg 


1401 


128 


Ala 


Gly 


Phe 


Thr 


Gly 


Ser 


Gly 


Pro 


■ He 


Gin 


Leu 


Trp 


Gin 


Phe 


Leu 


Leu 




129 


355 










360 










365 










370 




131 


gag 


ctg 


eta 


tea 


gac 


aaa 


tec 


tgc 


cag 


tea 


ttc 


ate 


age 


tgg 


act 


gga 


1449 


132 


Glu 


Leu 


Leu 


Ser 


Asp 


Lys 


Ser 


Cys 


Gin 


Ser 


Phe 


He 


Ser 


Trp 


Thr 


Gly 
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133 










375 










380 










385 






135 


gac 


gga 


tgg 


gag 


ttt 


aag 


etc 


gec 


gac 


ccc 


gat 


gag 


gtg 


gec 


cgc 


egg 


1497 


136 


Asp Gly Trp 


Glu 


Phe 


Lys 


Leu 


Ala 


Asp 


Pro 


Asp 


Glu 


Val 


Ala 


Arg 


Arg 




137 








390 










395 










400 








139 


tgg 


gga 


aag 


agg 


aaa 


aat 


aag 


ccc 


aag 


atg 


aac 


tac 


gag 


aag 


ctg 


age 


1545 


140 


Trp Gly Lys 


Arg 


Lys 


Asn 


Lys 


Pro 


Lys 


Met 


Asn 


Tyr 


Glu 


Lys 


Leu 


Ser 




141 






405 










410 










415 










143 


egg 


ggc 


tta 


cgc 


tac 


tat 


tac 


gac 


aag 


aac 


ate 


ate 


cac 


aag 


acg 


teg 


1593 


144 


Arg Gly 


Leu 


Arg 


Tyr 


Tyr 


Tyr 


Asp 


Lys 


Asn 


He 


He 


His 


Lys 


Thr 


Ser 




145 




420 










425 










430 












147 


ggg 


aag 


cgc 


tac 


gtg 


tac 


cgc 


ttc 


gtg 


tgc 


gac 


etc 


cag 


aac 


ttg 


ctg 


1641 


148 


Gly 


Lys 


Arg 


Tyr 


Val 


Tyr 


Arg 


Phe 


Val 


Cys 


Asp 


Leu 


Gin 


Asn 


Leu 


Leu 




149 


435 










440 










445 










450 




151 


ggg 


ttc 


acg 


ccc 


gag 


gaa 


ctg 


cac 


gec 


ate 


ctg 


ggc 


gtc 


cag 


ccc 


gac 


1689 


152 


Gly 


Phe 


Thr 


Pro 


Glu 


Glu 


Leu 


His 


Ala 


He 


Leu 


Gly 


Val 


Gin 


Pro 


Asp 




153 










455 










460 










465 






155 


a eg 


gag 


gac 


tgaggtcgee gggaccaccc tgagccggcc ccaggctcgt 






1738 


156 


Thr 


Glu 


Asp 






























159 


ggactgagtg ggaageccat cctgaccagc tgcctccgag 


gacccaggaa aggcaggatt 


1798 


160 


gaaaatgtcc aggaaagtgg ccaagaagca 


i gtggccttat 


tgcatcccaa accacgcctc 


1858 


161 


ttgaccaggc tgcctccctt gtggcagcaa 


l cggcacagct 


aattctactc acagtgcttt 


1918 


162 


taagtgaaaa tggtcgagaa agaggcaccg ggaagccgtc 


ctggcgcctg geagtcegtg 


1978 


163 


ggacgggatg gttctggctg tttgagattc tcaaaggagc 


gagcatgtcg ■ 


bggacacaca 


2038 


164 


cagactattt ttagattttc ttttgccttt tgcaaccagg 


aacagcaaat < 


jcaaaaactc 


2098 


165 


tttgagaggg taggagggtg ggaaggaaac aaccatgtca 


tttcagaagt * 


tagtttgtat 


2158 


166 


atattataat aatcttataa ttgttctcag aatcccttaa 


cagttgtatt ■ 


taacagaaat 


2218 


167 


tgtatattgt aatttaaaat aattatataa ctgtatttga 


aataagaatt < 






2269 


169 


<210> SEQ ID 


NO: 


2 


























170 


<211> LENGTH 


: 469 


























171 


<212> TYPE: PRT 




























172 


<213> ORGANISM: Homo 


sapiens 






















174 


<400> SEQUENCE: 2 


























175 


Met 


Asn 


Asp 


Phe 


Gly 


He 


Lys 


Asn 


Met 


Asp 


Gin 


Val 


Ala 


Pro 


Val 


Ala 




176 


1 








5 










10 










15 






177 


Asn 


Ser 


Tyr 


Arg 


Gly 


Thr 


Leu 


Lys 


Arg 


Gin 


Pro 


Ala 


Phe 


Asp 


Thr 


Phe 




178 








20 










25 










30 








179 


Asp 


Gly 


Ser 


Leu 


Phe 


Ala 


Val 


Phe 


Pro 


Ser 


Leu 


Asn 


Glu 


Glu 


Gin 


Thr 




180 






35 










40 










45 










181 


Leu 


Gin 


GlU 


Val 


Pro 


Thr 


Gly 


Leu 


Asp 


Ser 


He 


Ser 


His 


Asp 


Ser 


Ala 




182 




50 










55 










60 












183 


Asn 


Cys 


Glu 


Leu 


Pro 


Leu 


Leu 


Thr 


Pro 


Cys 


Ser 


Lys 


Ala 


Val 


Met 


Ser 




184 


65 










70 










75 










80 




185 


Gin 


Ala 


Leu 


Lys 


Ala 


Thr 


Phe 


Ser 


Gly 


Phe 


Lys 


Lys 


Glu 


Gin 


Arg Arg 




186 










85 










90 










95 






187 


Leu 


Gly 


He 


Pro 


L Y S 


Asn 


Pro 


Trp 


Leu 


Trp 


Ser 


Glu 


Gin 


Gin 


Val 


Cys 




188 








100 










105 










110 








189 


Gin 


Trp 


Leu 


Leu 


Trp 


Ala 


Thr 


Asn 


Glu 


Phe 


Ser 


Leu 


Val 


Asn 


Val 


Asn 




190 






115 










120 










125 










191 


Leu 


Gin 


Arg 


Phe 


Gly 


Met 


Asn 


Gly 


Gin 


Met 


Leu 


Cys 


Asn 


Leu 


Gly 


Lys 
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140 

« 4 145 , rlu Gin Met II. Ly. =1" "» Gln ? 175 

196 rn „ fin Asn Ser His Leu Thr Ser va 

197 Gin Tyr Glu Glu lg5 ^ Qln 

198 Thr Leu Gly Phe Gly Thr Glu Gin Ala Pro Ty 

199 ser Asn Thr Y aj 0 ^ _ Met Cys Pro Ala 

220 prQ 

210 val Leu Ser Ser Glu Gin Glu Phe Gin Met 

ser Thr Pro Ser Val Leu 235 ^ ^ 

225 T ser Ser Val Ser Val Thr Tyr Cys Ser V 

Lys Ser Arg Leu Ser Ser 25Q 

245 Thr As n Asn Ser 

„ „ nv Ser Asn Leu Asn Leu Leu in 
207 Asp Phe Pro Gly Ser ^ ^ 

,sp His Asp Ser Pro Glu Asn Gly Ala Asp Ser 

209 Pro Lys Asp Ha.s A P ^ Asp 

210 275 rm ser Trp Asn Ser Gin Ser Ser u 
l\l Ser Asp Ser Leu Leu Gin Ser ^ 

212 2 ?° Rra val Pro Ser Phe Glu Ser Phe Glu Asp Asp 

2 13 Val Gin Arg Val fro 315 
2 * 4 305 Le u Asn Lys Pro Thr Met Ser Phe Lys Asp Tyr I 

215 Ser Leu Cys Leu Asn Gly Ly Pro Val lie Pro Ala Ala 

216 0 .__ Pro val Glu Gin Gly Lys fro 

217 Glu Arg Ser Asp Pro 345 phe 

218 , ,eu Ala Gly Phe Thr Gly Ser Gly Pro He Gin Leu 

219 Val Leu Ala G±y 36Q ger Trp 
Leu HI Le u Leu Ser ASP Lys Ser Cys Gin Ser 

221 Leu Leu Glu Leu ^ j ^ Alg 

222 3 J° , sc Gly Trp Glu Phe Lys Leu Ala Asp Pro Asp ^ 

223 Thr Gly Asp Gly Trp 395 Lyg 

224 385 nv Lvs Arg Lys Asn Lys Pro Lys Met Asn Ty 

225 Arg Arg Trp Gly Lys Arg 41Q ^ 

, a Glv Leu Arg Tyr Tyr Tyr Asp Lys Asn 
Leu Ser Arg Gly Leu a y ^ Gln Asn 

nv Lvs Arg Tyr Val Tyr Arg Phe Val Cys A P Leu 
Thr Ser Gly Lys Arg y 4* 

435 _ rlll cm Leu His Ala He Leu Gly 

Leu Leu Gly Phe Thr Pro Glu Glu 46Q 



200 

201 

202 

203 

204 

205 

206 



226 
227 
228 
229 
230 
231 

232 450 

233 Pro Asp Thr Glu Asp 

234 465 

236 <210> SEQ ID NO: 
2 37 <211> LENGTH: 84 

«Trnn . "DPT 1 



3 



S 2£ S«; ' _ OT «. - - «-» «» - ™ IM ASP 

242 Ser Gly Pro He Gln Leu 
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243 15 10 15 



244 


Lys Ser Cys 


Gin 


Ser 


Phe 


He 


Ser 


Trp 


Thr Gly Asp Gly Trp Glu Phe 


245 




20 










25 


30 


246 


Lys Leu Ser 


Asp 


Pro 


Asp 


Glu 


Val 


Ala 


Arg Arg Trp Gly Lys Arg Lys 


247 


35 










40 




45 


248 


Asn Lys Pro 


Lys 


Met 


Asn 


Tyr 


Glu 


Lys 


Leu Ser Arg Gly Leu Arg Tyr 


249 


50 








55 






60 


250 


Tyr Tyr Asp 


Lys 


Asn 


He 


He 


His 


Lys 


Thr Ala Gly Lys Arg Tyr Val 


251 


65 






70 








75 80 


252 


Tyr Arg Phe 


Val 














255 


<210> SEQ ID 


NO: 


4 












256 


<211> LENGTH: 


: 84 















257 <212> TYPE: PRT 

258 <213> ORGANISM: Mus musculus 
260 <400> SEQUENCE: 4 



261 


Ser Gly 


Pro 


He 


Gin 


Leu 


Trp 


Gin 


Phe 


Leu 


Leu 


Glu Leu Leu 


Thr 


Asp 


262 


1 






5 










10 






15 




263 


Lys Ser 


Cys 


Gin 


Ser 


Phe 


He 


Ser 


Trp 


Thr Gly Asp Gly Trp 


Glu 


Phe 


264 






20 










25 






30 






265 


Lys Leu 


Ser 


Asp 


Pro 


Asp 


Glu 


Val 


Ala 


Arg 


Arg 


Trp Gly Lys 


Arg 


Lys 


266 




35 










40 








45 






267 


Asn Lys 


Pro 


Lys 


Met 


Asn 


Tyr 


Glu 


Lys 


Leu 


Ser 


Arg Gly Leu 


Arg 


Tyr 


268 


50 










55 










60 






269 


Tyr Tyr 


Asp 


Lys 


Asn 


He 


He 


His 


Lys 


Thr 


Ala 


Gly Lys Arg 


Tyr 


Val 


270 


65 








70 










75 






80 


271 


Tyr Arg 


Phe 


Val 






















274 


<210> SEQ ID 


NO: 


5 





















275 <211> LENGTH: 84 

276 <212> TYPE: PRT 

277 <213> ORGANISM: Gallus gallus 
279 <400> SEQUENCE: 5 



280 


Ser 


Gly 


Pro 


He 


Gin 


Leu 


Trp 


Gin 


Phe 


Leu 


Leu Glu Leu 


Leu 


Thr 


Asp 


281 


1 








5 










10 






15 




282 


Lys 


Ser 


Cys 


Gin 


Ser 


Phe 


He 


Ser 


Trp 


Thr Gly Asp Gly Trp Glu 


Phe 


283 








20 










25 






30 






284 


Lys 


Leu 


Ser 


Asp 


Pro 


Asp 


Glu 


Val 


Ala 


Arg 


Arg Trp Gly 


Lys 


Arg 


Lys 


285 






35 










40 






45 








286 


Asn 


Lys 


Pro 


Lys 


Met 


Asn 


Tyr 


Glu 


Lys 


Leu 


Ser Arg Gly 


Leu 


Arg 


Tyr 


287 




50 










55 








60 








288 


Tyr 


Tyr 


Asp 


Lys 


Asn 


He 


He 


His 


Lys 


Thr 


Ala Gly Lys 


Arg 


Tyr 


Val 


289 


65 










70 










75 






80 


290 


Tyr 


Arg 


Phe 


Val 























293 <210> SEQ ID NO: 6 

294 <211> LENGTH: 84 

295 <212> TYPE: PRT 

296 <213> ORGANISM: Xenopus laevis 

298 <400> SEQUENCE: 6 

299 Ser Gly Pro He Gin Leu Trp Gin Phe Leu Leu Glu Leu Leu Thr Asp 

300 15 10 15 
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